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Abstract 

Polar lattices, which are constructed from polar codes, have recently been proved to be able to achieve the capacity 
of the additive white Gaussian noise (AWGN) channel. In this work, we propose a new construction of polar lattices 
to solve the dual problem, i.e., achieving the rate-distortion bound of a memoryless Gaussian source, which means 
that polar lattices can also be good for the lossy compression of continuous sources. The structure of the proposed 
polar lattices enables us to integrate the post-entropy coding process into the lattice quantizer, which simplifies 
the quantization process. The overall complexity of encoding and decoding complexity is 0{N\o^ N) for a sub- 
exponentially decaying excess distortion. Moreover, the nesting structure of polar lattices further provides solutions 
for some multi-terminal coding problems. The Wyner-Ziv coding problem for a Gaussian source can be solved by 
an AWGN capacity-achieving polar lattice nested in a rate-distortion bound achieving one, and the Gelfand-Pinsker 
problem can be solved in a reversed manner. 


I. Introduction 

Vector quantization (VQ) IT] has been widely used for source coding of image and speech data since the 1980s. 
Compared with scalar quantization, the advantage of VQ, guaranteed by Shannon’s rate-distortion theory, is that 
better performance can always be achieved by coding vectors instead of scalars, even in the case of memoryless 
sources. However, the Shannon theory does not provide us any constructive VQ design scheme. During the past 
several decades, many practical VQ techniques with relatively low complexity have been proposed, such as lattice 
VQ in, multistage VQ |3], tree-structured VQ H, gain-shape VQ fS], etc. Among them, lattice VQ is of particular 
interest because its highly regular structure makes compact storage and fast quantization possible. 

In this work, we present an explicit construction of polar lattices for quantization, which achieves the rate- 
distortion bound of the continuous Gaussian source. It is well known that the optimal output alphabet size is infinite 
for continuous-amplitude sources. Particularly, the rate distortion function for the Gaussian source of variance cr^ 
under the squared-error distortion measure d{x,y) = \\x — j/|p is given by 

i?(A) = maxjilog (1) 

where A and R denote the average distortion and rate per symbol, respectively. However, in practice, the size 
of the reconstruction alphabet needs to be finite. Unconstructively, E Theorem 9.6.2] shows the existence of 
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a block code with finite number of output letters that achieves performance arbitrarily close to the rate-distortion 
bound. Then a size-constrained output alphabet rate-distortion function was defined in [Tl with M denoting 

the size of output alphabet. The well-known trellis coded quantization (TCQ) HI was motivated by this alphabet 
constrained rate-distortion theory. It was shown that for a given encoding rate of R bits per symbol, the rate- 
distortion function i?(A) can be approached by using a TCQ encoder with rate i? -f 1 after an initial Lloyd- 
Max quantization. It is equivalent to the trellis coded modulation (TCM) in the sense that m information bits 
are transmitted using 2"*+^ constellation points. A near-optimum lattice quantization scheme based on tailbiting 
convolutional codes was introduced in M- Despite enjoying a good practical performance, a theoretical proof of 
the rate-distortion bound achieving TCQ with low complexity is still missing. More recently, a scheme based on 
low density Construction-A (LDA) lattices ifTOl was proved to be quantization-good (defined in Sect. HID i using the 
minimum-distance lattice decoder. However, in practice the ideal performance cannot be realized by the suboptimal 
belief-propagation decoding algorithm. 

Polar lattices have the potential in solving this problem with low complexity. As shown in im, this class of lattices 

allows us to employ the discrete Gaussian distribution for lattice shaping. This distribution shares many similar 

properties to the continuous Gaussian distribution and obtains the optimal shaping gain when its associated flatness 

factor is negligible. We may use the discrete Gaussian distribution instead of the continuous one as the distribution 

of the reconstruction alphabet. This idea has already been proposed in ifT^ for random lattice quantization. It is 

also shown in HD that even using binary lattice partition, the number of the partition levels r does not need to be 

very large (O(loglogW)) to achieve the capacity ilog(l + SNR) of the additive white Gaussian noise (AWGN) 

channel, where SNR denotes the signal noise ratio. By the duality between source coding and channel coding, 

the quantization lattices can be roughly viewed as a channel coding lattice constructed on the test channel. For a 

Gaussian source with variance a1 and an average distortion A, the test channel is actually an AWGN channel with 

noise variance A. In this case, the “SNR” of the test channel is and its “capacity” is ilog(^), which 

2 

implies that the rate of the polar lattice quantizer can be made arbitrarily close to ^ log(^). Therefore, based on 
this idea, we propose the construction of polar lattices which are good for quantization in this work. We note that 
the difference between the quantization polar lattices and the AWGN channel coding polar lattices not only lies 
in the construction of their component polar codes, but also in the role of their associate flatness factors. For the 
AWGN channel coding polar lattices, the flatness factor is required to be negligible to ensure a coding rate close to 
the AWGN capacity and it has no influence on the error cotTection performance. For the quantization polar lattices, 
however, the flatness factor affects both the compression rate and the distortion performance. This is also the reason 
why the lattice Gaussian distribution can be optimal for both channel coding and quantization simultaneously (see 
Remark [D, and consequently be utilized for Gaussian Wyner-Ziv and Gelfand-Pinsker coding. 

A. Our contribution 

The novel technical contribution of this paper is two-fold; 

• The construction of polar lattices for the Gaussian source and the proof of their rate-distortion bound achieving. 
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This is a dual work of capacity-achieving polar lattices for the AWGN channel, and it can also be considered as 
an extension of binary polar lossy coding to the multilevel coding scenario. Compared with traditional lattice 
quantization schemes 113, m, which generally require a separate entropy encoding process after obtaining 
the quantized lattice points, our scheme naturally integrates these two processes together. The analysis of these 
quantization polar lattices prepares us for the further discussion of Gaussian Wyner-Ziv and Gelfand-Pinsker 
problems. 

• The solutions of the Gaussian Wyner-Ziv and Gelfand-Pinsker problems, which consist of two nested polar 
lattices. One is AWGN capacity-achieving and the other is Gaussian rate-distortion bound achieving. The two 
lattices are simultaneously shaped according to a proper lattice Gaussian distribution. Note that the Wyner-Ziv 
and Gelfand-Pinsker problems for the binary case have been solved by Korada and Urbanke using nested 
polar codes. However, in the Gaussian case, the problems mrn out to be more complicated as the Wyner-Ziv 
bound becomes lower (the Gelfand-Pinsker capacity becomes larger by duality). As mentioned in fTh), the 
extremely severe conditions iflhl eq. (12)] and iflhl eq. (18)] for the bound, which corresponds to the scenario 
where both encoder and decoder know the side information, can be satisfied in the Gaussian case rather than 
the binary case because of infinite alphabet size, meaning that more effort should be made for the Gaussian 
case. As a result, our polar lattice coding scheme achieves the whole region of the Wyner-Ziv bound and has 
no requirement on the signal noise ratio for the Gelfand-Pinsker capacity. 

B. Relation to Prior Works 

As mentioned above, although the TCQ technique performs well in practice, its theoretical limit is still unclear, 
to the best of our knowledge. Polar lattices, as we will see, can be theoretically proved to be able to achieve the 
rate-distortion bound. Moreover, thanks to their low complexity, considerably high-dimensional polar lattices are 
available in practice, providing a quantization performance with gap less than 0.2 dB to the achievable bound when 
the lattice dimension N = 2^®. 

The sparse regression codes were also proved to achieve the the optimal rate-distortion bound of i.i.d Gaussian 
sources with polynomial complexity iflTll . lITSl. In fact, there exists a trade-off between the distortion performance and 
encoding complexity. For a block length N, typical encoding complexity of this kind of codes is 0((A^/log A^)^) 
for an exponentially decaying excess distortion with exponent 0(A^/logA^), and their designed random matrix 
incurs N x storage complexity. In comparison, the construction of polar lattices is as explicit as that of 

polar codes themselves, and the complexity is quasi-linear 0{N\o^ N) for a sub-exponentially decaying excess 
distortion with exponent roughly 0{'/N)- 

The saliently nesting structure of polar lattices also gives us solutions to the Gaussian Wyner-Ziv and Gelfand- 
Pinsker problems. According to the prior work by Zamir, Shamai and Erez m, EQI, the two problems can be 
solved by nested quantization-good and AWGN-good lattices. However, due to the lack of explicit construction of 
such good lattices, no explicit solution was addressed. A practical scheme based on multidimensional nested lattice 
codes for the Gaussian Wyner-Ziv problem was also proposed in ED. The performance of this scheme can be 
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very close to the Wyner-Ziv bound but a theoretical proof is still missing. A lattice-based Gelfand-Pinsker coding 
scheme using repeat-accumulate codes, which were concatenated with trellis shaping, was also presented in ll22ll . 
This scheme was shown to be able to obtain a very close-to-capacity performance. Unfortunately, the complexity 
grows exponentially to achieve the shaping gain and a theoretical proof for the Gelfand-Pinsker capacity-achieving is 
also missing. In this work, we solve these problems by combining the recently proposed AWGN capacity achieving 
polar lattices im and the rate-distortion bound achieving ones. 

C. Outline of the paper 

The paper is organized as follows: Section II presents the background of polar codes and polar lattices. The 
construction of rate-distortion bound achieving polar lattices is investigated in Section III. Some simulation results 
of polar lattices for quantization are given in Section IV. In Section V and Section VI, the solutions of the Gaussian 
Wyner-ziv and Gelfand-Pinsker problems are addressed accordingly, by combining the AWGN capacity achieving 
polar lattices and the proposed quantization polar lattices. The paper is concluded in Section VII. 

D. Notations 

All random variables (RVs) will be denoted by capital letters. Let Px denote the probability distribution of a RV 
X taking values a: in a set X. For multilevel coding, we denote by X(, a RV X at level 1. The i-th realization of 
Xi is denoted by x\. We also use the notation as a shorthand for a vector {x\, ...,xj), which is a realization of 
RVs X^'^ = (Xf ...,Xf). Similarly, xf^ will denote the realization of the Ath RV from level i to level j, i.e., of 
Xf^ = iXf ...,Xj). For a set I, \X\ represents its cardinality. For an integer N, [A^] denotes the set of all integers 
from 1 to N. !(•) denotes an indicator function. Let I{X\Y) denote the mutual information between X and Y. 
The notations I{X; V)+ and R ^ I{X; Y)~ will be used to represent a rate approaching I{X;Y) from the 
right side (equal or greater than I{X;Y)) and the left side (equal or less than I{X;Y)), respectively. Throughout 
this paper, we use the binary logarithm and information is measured in bits. 

IT Background 

A. Lattice Codes and lattice Gaussian distribution 

A lattice is a discrete subgroup of M" which can be described by 

A = {A = Bz : z € Z"}, (2) 

where the columns of the generator matrix B = [bi, • • • ,bn,] are assumed to be linearly independent. 

For a vector x G R", the nearest-neighbor quantizer associated with A is Qx{x) = argmin^gA ||A — x||, where 
ties are resolved arbitrarily. We define the modulo lattice operation by x mod A = x — Qa{x). The Voronoi region 
of A, dehned by V(A) = {x : Qa{x) = 0}, specihes the nearest-neighbor decoding region. The volume of a 
fundamental region is equal to that of the Voronoi region V(A), which is given by U(A) = |det(B)|. 
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For cr > 0 and c € K", we define the Gaussian distribution of variance centered at c as 

1 


f<T,c{x) = 


-e j X G 


{\p2^ay 

Let fafi{x) = fa{x) for short. The A-periodic function is defined as 

1 




r-A|| 


e 2a 

AGA / AGA 

We note that fa,A{x) is a probability density function (PDF) if x is restricted to the fundamental region TZ{A). This 
distribution is actually the PDF of the A-aliased Gaussian noise, i.e., the Gaussian noise after the mod-A operation 


The flatness factor of a lattice A is defined as 


€A{cr) = max |L(A)/„,A(a;) - 1|. 
xen{A) 

We define the discrete Gaussian distribution over A centered at c as the discrete distribution taking values in 
A e A: 


DA,a,c{^) 


/o-,e(A) 

/..c(A)’ 


VA G A, 


(3) 


where /cr,c(A) = J2\£A convenience, we write D\ ,j = A^a.ct.o- It has been proved to achieve the 

optimum shaping gain when the flatness factor is negligible ll25l . 

A sublattice A' C A induces a partition (denoted by A/A') of A into equivalence groups modulo A'. The order of 
the partition is denoted by |A/A'|, which is equal to the number of the cosets. If |A/A'| = 2, we call this a binary 
partition. Let A(Ao)/Ai/ • • • /Ar_i/A'(Ar) for r > 1 be an n-dimensional lattice partition chain. If only one level 
is applied (r = 1), the construction is known as “Construction A”. If multiple levels are used, the construction is 
known as “Construction D” ^ p.232]. For each partition A^-i/A^ (1 < ^ < r) a code Ct over Ai-i/Ai selects a 
sequence of coset representatives ag in a set Ag of representatives for the cosets of Ag. This construction requires a 
set of nested linear binary codes Cg with block length N and dimension of information bits kg which are represented 
as [N, kg] for 1 < i < r and Ci C C 2 ■ ■■ Q Cr- Let ijj be the natural embedding of into Z^, where F 2 is 
the binary field. Consider gi, g 2 , • • • , gjv be a basis of F^ such that gi, • • • gke span Cg. When n = 1, the binary 
lattice L consists of all vectors of the form 




g=i j=i 


+ 2" 


(4) 


where G {0,1} and z G Z^. 


B. Polar codes and polar lattices 

Polar codes are the first kind of codes that can be proved to be able to achieve the capacity of any binary 
memoryless symmetric (BMS) channel. Let W{y\x) be a BMS channel with input alphabet X = {0,1} and output 
alphabet A’ C K.. Given the capacity I{W) of W and any rate i? < I{W), the information bits of a polar code 
with block length W = 2™ are indexed by a set of RN rows of the generator matrix Cn = [1 where 0 
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denotes the Kronecker product. The matrix Gn combines N identical copies of to l^jv. Then this combination 
can be successively split into N BMS subchannels, denoted by with 1 < i < By channel polarization, 
the fraction of good (roughly error-free) subchannels is about I(W) as m —oo. Therefore, to achieve capacity, 
information bits should be sent over those good subchannels and the rest are fed with frozen bits which are known 
before transmission. The indices of good subchannels can be identified according their associate Bhattacharyya 
Parameters. 

Definition 1 (Bhattacharyya Parameter for Symmetric Channel l[26\l}: Given a BMS channel W with transition 
probability Py\x, the Bhattacharyya parameter Z G [0,1] is defined as 

Z{W) =EVPY\xiy\o)PY\xiy\i)- 
y 

Based on the Bhattacharyya parameter, the information set I is defined as {i : and the frozen 

set P is the complement of I. Let Pg denote the block error probability of a polar code under SC decoding. It 
can be upper-bounded as Pb < An efficient algorithm to evaluate the Bhattacharyya parameter of 

subchannels when W is a binary Ag-i/Ai channel has been presented in ifTTl . based on the prior work on the 
construction of polar codes 1271, m. 

A polar lattice is constructed by using a set of nested polar codes as the component codes in (|4|i. An explicit 
construction of AWGN-good polar lattices (defined in ifTTl ') based on the multilevel approach of Forney et al. ll2^ 
has been presented in im. The key idea is to design a polar code to achieve the capacity for each level ^ = 1, 2,..., r 
in Construction D. 

To achieve the capacity of the power-constrained Gaussian channel, we need to apply Gaussian shaping over 
the AWGN-good polar lattice, which is considered to be difficult to do directly. Motivated by 1251 . we can apply 
Gaussian shaping to the top lattice A instead. This shaping process generally leads to a nonuniform input distribution 
and an binary memoryless asymmetric (BMA) channel for each partition. In this case, we need the recently 
introduced polar codes for asymmetric channels. 

Definition 2 (Bhattacharyya Parameter for BMA Channel ^291? . KlOi}: Let kF be a BMA channel with input 
X G X = {0,1} and output Y G y, and let Px and Py\x denote the input distribution and channel transition 
probability, respectively. The Bhattacharyya parameter Z for channel W is the defined as 

z{x\Y) = 2Y,PY{y)^JPx\Y{0\y)Px\Y{l\y) 

y 

V 

Note that Definition |2] is the same as Definition [T] when Px is uniform. 

Let X^'^ and be the input and output vector after N independent uses of W. We have the following 
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property of the polarized random variables Gn- 


lim [i ■ < 2"^'' and Z{W\U^'-^-^) > 1 - = I{X-,Y), 

7^00 A' I J \ 


N —¥oo 

lim — 
N^oo N 


|i : Z{W\U^'-^-\Y^'-^) > 2-^^ or Z{W\U^'-^-^) 


< 1-2 


-w'^l 


(5) 


= 1-I{X-,Y), 


which provides us a method of achieving the capacity of a BMA channel. Moreover, The Bhattacharyya parameter 
of a BMA channel can be related to that of a BMS channel, and the decoding of a polar code for the BMA channel 
can also be converted to that for the BMS channel. (See ifTTIl for more details.) 


III. Polar Lattices for Quantization 

Let Y ^ N{0, (Tg) denote a one dimensional Gaussian source with zero mean and variance cr^. Let (Y) be 
N independent copies of Y and (y) be a realization of Y^'^. The PDF of Y is given by friv) = fasiu)- For 
an A^-dimensional polar lattice L and its associated quantizer (3 l( ), the average distortion A after quantization is 
given by 


^ = 4 / \\y-QL{y)\\^ fY{y)dy. 

A Jrn 

The normalized second moment (NSM) of a quantization lattice L is defined as 

. . wIv(L)M^dv 
^ ’ L(L)l+2/N ’ 

where vector v is uniformly distributed in V{L). 

Definition 3: An A-dimensional lattice L is called quantization-good ifBl if 

lim G(L) = -. 

Af->-oo 27re 


( 6 ) 


(7) 


( 8 ) 


InlH, an entropy-coded dithered quantization (ECDQ) scheme based on quantization-good lattices was proposed 
to achieve the rate-distortion bound O. This scheme requires a pre-shared dither which is uniformly distributed 
in the Voronoi region of a quantization-good lattice and an entropy encoder after lattice quantization. For our 
quantization scheme, we will show that dither is not necessary and the entropy encoder can be integrated in the 
lattice quantization process, which brings much convenience for practical application. 

Our task is to construct a polar lattice which achieves the rate distortion bound of the Gaussian source with 
reconstruction distribution /—^ —r. Following the notation of AWGN-good polar lattices im, we use X to 


A.y'o-S-A 

denote the reconstruction alphabet. Firstly, we prove that the rate achieved by D 




can be arbitrarily close 


to i log(^). Note that the following theorem is essentially the same as Theorem 2 in ||25]| . Here we just reexpress 
it in the source coding formulation. 

Theorem 1 ( (ESI?).- Consider a Gaussian test channel where the reconstruction constellation X has a discrete 
Gaussian distribution Da-c,(t^ for arbitrary c G M", and where cr^ = cr^ — A with A being the average distortion. 
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t > 1 /e 
0 <t < 1 /e 


Let CTA — 2 vV 5 Then, if e = eaCcta) < h and < e where 

(Tg ' ^ Z i — €t 


A 

Et = 



(f '* + l)eA(crr/ 



(9) 


the discrete Gaussian constellation results in mutual information /a > 5 log(^) — ^ per channel use. 

The statement of Theorem [T] is non-asymptotical, i.e., it can hold even if n = 1. Therefore, it is possible to 
construct a good polar lattice over one-dimensional lattice partition such as Z/2Z/4Z.... The flatness factor e can 
be made negligible by scaling this binary partition. This technique has already been used to construct AWGN 
capacity-achieving polar lattices ini. 

Note that when the test channel is chosen to be an AWGN channel with noise variance A and the reconstruction 
alphabet is discrete Gaussian distributed, the source distribution is not exactly a continuous Gaussian distribution. 
In fact, it is a distribution obtained by adding a continuous Gaussian of variance A to a discrete Gaussian 
which is expressed as the following convolution 


fY'{y') 




( 10 ) 


where a = ^fK and Y' denotes the new source. For simplicity, in this work we only consider one dimensional 
binary partition chain (n = 1) and Y' is also a one dimensional source. 

Therefore, we are actually quantizing source Y' instead of Y using the discrete Gaussian distribution. However, 
when the flatness factor is small, a good quantizer constructed from polar lattices for the source Y' is also 

good for source Y because of the following lemma. The relationship between the quantization of source Y' and Y 
is shown in Fig. 1. 


A(0 

,A) 

1- ] 

Y~ P 1 

^ 1 V y> f 

i ^ 

U \ ^ Jy 

Similarly 

test channel ^ distributed 

\TLf\ — 2\ 1 

^ I ■» y rr 

ly (u, cT,. ; j ^ 

i A 

k * 

N{0 

,A) 


Fig. 1. The relationship between the quantization of source Y' and Y. 


Lemma 1 ( HS]/ ): If e = the variational distance between the density fy/ of source Y' defined in 

(Ho} and the Gaussian density fy satisfies 'V{fy', fy) < 4e. 

Now we consider the construction of polar lattices for quantization. Firstly, we consider the quantization of source 
y' using the reconstruction distribution DA,crr- Since binary partition is used, X can be represented by a binary 
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String Xi:r, and we have liinr_>.oo Pxi r = Px = D/^ a-r- Because the polar lattices are constructed by “Construction 
D”, we are interested in the test channel at each level. Similar to the setting of shaping for AWGN-good polar 
lattices ifTTI . given the previous and the coset Ag determined by xi-g, the channel transition PDF at level £ 

is 


_ EaeA,{x,..,)Pia)PY'\Aiy'\a) 

P{Ae{xi.,i)} 

y'^ \_ 1 _ 1 

2{a‘^ + A)] fa^{Ag{xi,i)) 2TT\PKas ^ 




( 11 ) 


2 

where a = is equal to the MMSE coefficient and aA = Consequently, using D\^cr^ as the constellation, 

the £-th channel is generally asymmetric with the input distribution Pxe\Xi.i_i < f), which can be calculated 
according to the definition of D\^crr- 

The lattice quantization can be viewed as lossy compression for all binary testing channels from level 1 to r. 
Here we start with the first level. Let denotes the realization of N i.i.d copies of source Y'. Although Y' 

is a continuous source with density given by (fTOl l and y' is drawn from R, to keep the notations consistent (the 
definition of the Bhattacharyya parameter ll26l is given by a summation), from now on we will express the distortion 
measurement as well as the variantional distance in the form of summation instead of integral. 

Since the test channel at each level is not necessarily symmetric and the reconstruction constellation is not 
uniformly distributed, we have to consider the lossy compression for nonuniform source and asymmetric distortion 
measure. The solution of this problem has already been introduced in and it turns out to be similar to the 
construction of capacity-achieving polar codes for asymmetric channels. 

For the first level, letting Ul'^ = XI'-^Gn, where Gn is the N x N generator matrix of polar codes, we define 
the information set Ii, frozen set Pi and shaping set Si based on the Bhattacharyya parameter as follows: 

' J-i = G [N] : ,Y'^'-^) > 1 - 2-^'"} 

< = {i G [N] : Z{Ui\Ul'-^-^) > 2-^" and Z{Ul\Ul'-'^-^,Y'^'-^) < 1 - 2"^''} (12) 

Si={i& [TV] : Z{Ul\Ul'-^-^) < 2-^^}. 

The shaping set 5i is determined by the distribution Pxi- Note that this definition is similar to that in ifTTl 
Equation (21)]. The difference is that Ii is designed to be slightly larger to guarantee a desired distortion level. 
The asymmetric Bhattacharyya parameter Z[U{\Ui'^~^,Y'^'^) and Z{U{\Ui'‘^~^) can be efficiently calculated 
by symmetric Bhattacharyya parameter Z{ij{\Ul'''~^,Xl'^ 0 and Z(jj{\Ul'''~^,Xl'^ 0 

respectively (see in for more details). According to m Theorem 5], the proportion of set Xi approaches I{Xi \ Y') 
when N ^ oo. 

After getting Pi, Ii and iSi, for a source sequence y'^'^, the encoder determines ul'^ according to the following 


November 3, 2015 


DRAFT 










10 


rule; 


0 w. p. P, 


^n-.N (Olu 


l:i-l 


U, = 




1 W. p. 


l:i-l „,n:N 


if i G Ii, 


(13) 


and 


(14) 


u\ if j G Pi 

argmaxPj^i|j^i;i-iif i G 5i. 

_ ^ 
Here u\ is a uniformly random bit determined before lossy compression. The output of the encoder at level 1 is ^ = 

{u\, i G Ii}. To reconstruct x\'^, the decoder uses the shared and the received to recover according to 

argmaxPj^i|j^i;i-iand then a;}'^ = u\-^Gn- The probability and Pjji^jji-.i-i yn-N can both 

be calculated efficiently by the successive cancellation algorithm with complexity 0{N \ogN) ifTTIl . 

Theorem 2: Let Qiji-n yo-n{u\'^ denote the joint distribution for Ul'^ and according to the 

encoding rule described in (fOl) and (fT4l i. Consider another encoder using the encoding rule (fOl l for all i G [N] 
and let Piji.n yo:n{u\'^ denote the resulted joint distribution. For any /3' < ^ < 1/2 satisfying (fTSli and 

Pi = M >/(Xi;r'), 

,YO:N,Qij1-.N ^YO:n) = 0{2 ^ ). (15) 

The same statement has been given in ll^ yet without proof. Here we prove the theorem in Appendix for 
completeness. 

Now we introduce the construction for higher levels. Taking the second level as an example, to make up the 
reconstruction constellation distribution, the input distribution at level 2 should be Px 2 \Xi- Based on the quantization 
results {Ui '^given by the encoder at level 1, some U 2 {U^'^ = X^'^Gn) is almost deterministic given 
(t /2 Since there is a one-to-one mapping between Xl'^ and Ul'^, given (C/^ is the same 

as given We define the information set I 2 , frozen set P 2 and shaping set ^2 as follows; 

'^2 = {t G [N] : Z{U^2\U2'~\xI-^X^''^) > 1-2-^'"} 

I 2 = {i G [N] ; Z{U^\U^'-*-^,Xl'-^) > 2-^'’ and Z{W 2 \U 2 '~\ < 1 “ 2”^''} (16) 

[ 52 = {* G [TV] ; Z{U^\U^--^-\xl--^) < 2"^"}. 

The proportion of I 2 approaches /(-V 2 ; Y\Xi) when N is sufficiently large ifTTl . For a given source sequence pair 


(«! , 2 /'^'^) or (a;i , 2 /'^'^), the encoder at level 2 determines 


U 9 = 


0 W.p. ^yo--n{Q\u2 

1 w.p. y,l:Ar(l|U 2 


according to the following rule; 

i.-i 

if i G I 2 , (17) 


,a;i 


l:t-l l-.N /l:^r^ 

xi ,y ) 


and 


Un = 


U 2 if * G p2 

argmaxPyi|j^i;i-i if i € 82 - 


(18) 
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We further extend Theorem |2] to the second level. 


Theorem 3: Let Qiji.n jji-.n yo:n{uI'^,u\'^ denote the joint distribution for and {Ul'^ 

according to the encoding rule described in (fTTl l and (fTSl) . Consider another encoder using the encoding rule (fTTl i for 
all i £ [N] and let Pjji-.n jji-.n yo-N , u\'^, denote the resulted joint distribution. For any /3' < /3 < 1/2 

satisfying ( fT^ and i ?2 = > I{X 2 ;Y'\Xi), 




(19) 


Note that Theorem is based on the assumption that y/i-.N ^ Qjji.n y/i-.w ) = 0(2 ), which means 

that we also need Ri > I{Xi;Y'). Therefore, we have X]i=i > ^(^ 1 -^ 2 ; 1^0- 

By induction, for level I (i < r), we dehne the three sets J/, Xi and Si in the same form as (fThl l with 
replacing and Ui replacing U 2 - Similarly, the encoder determines = x\'^G n) according to the rule 

given by (fTTl) and (fTSl l. with Xy^_-^ and replacing and x\'^, respectively. Let Qjji-n yn-.w [uy^, y'^'^) 

denote the associate joint distribution resulted from this encoder and Pjji.n yn:N{uy^,y'^'^) denote the one that 
resulted from an encoder only using (fTTI l for all i £ [N], We have V(Pj/i;n yn-.N,Q jji-.n yn-.N) = 0{l ■ ) 

for any rate Ri = > I{Xi;Y'\Xi.i-i). Specihcally, at level r, for any rate Rr > I{Xr;Y'\Xi.r-i) and 

LLi > I{Xi,r;Y'), we have 




r^yn 


r,Q 


C/il'f ,V'1 


0=O(r 


2 -^"'). 


( 20 ) 


By ifTTl Lemma 5], I{Xyj.\Y') is arbitrarily close to I{X\Y') when N is sufficiently large and r = O(loglogX), 
which gives us Y{Pjji-.n^ y,i,N ,Qui-.n y,i,N) = 0{2~^^ ). 

Now we present the main theorem of this section. The proof is given in Appendix |C] 

Theorem 4: Given a Gaussian source Y with variance and an average distortion A < cr^, for any rate 
2 

R > ilog(^), there exists a multilevel polar code with rate R such that the distortion is arbitrarily close to 
A when A —^ 00 and r = 0(loglog A). This multilevel polar code is actually a shifted polar lattice L + c 
constructed from the lattice partition A/A' with a shaping according to the discrete Gaussian distribution Pa. a,., 
where = -^/cr^ — A and the partition chain is scaled to make ca (0. 

Remark 1. From the proof of Theorem 01 it seems that R could be slightly smaller than 2 log ^ (since R > 
I{X]Y') > 2 log ^ to reach an average distortion A, which would contradict Shannon’s rate-distortion 

theory. However, this is not the case. When R < ^log-^, an arbitrarily small eA(CTA) cannot be guaranteed, 
which means that the resulted distortion cannot be arbitrarily close to A. To achieve the desired distortion, we need 
i? > 2 log ^ for all possibly small eA(o’A), which leads to i? > 2 log 


IV. Simulation Results 

The quantization performance of polar lattices for a Gaussian source with standard deviation CTs = 3 and target 
distortion from 0.1 to 2.5 is shown in Fig.|2] It reveals that the rate-distortion bound is approached as the dimension 
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of polar lattices increases from N = to N = 2^®. Particularly, when N = 2^®, the gap to the rate-distortion 
bound is less than 0.2 dB. 


3.5 - ' 


3 - 


2.5 - 


cd 

Qi 


2 - 


1.5 - 


1 - 


0.5 


\ 






\ 


NH'sir " 

<12:* . ««• , 




— 

Rate-Distortion bound 

— 

Scalar quantization 

0 

Polar lattice with N=2** 

□ 

Polar lattice with N=2*^ 

A 

Polar lattice with N=2*^ 


Polar lattice with N=2’^ 

* 

Polar lattice with N=2**^ 








■* 


0.5 1 1.5 2 

Average Distortion 


2.5 


Fig. 2. Quantization performance of polar lattices for the Gaussian source with as = 3. 




(a) V(/y/,/y) when r = 6 and A = 0.5. (b) Quantization noise v.s. Gaussian noise when A = 0.1. 

Fig. 3. Variational distance V(/y/,/y) and quantization noise for polar lattice quantizer. 


In this work, the number of levels is chosen to be 6 to guarantee a negligible variational distance V(/y',/y) 
for all target distortions. For a target distortion A = 0.5, the two densities of Y' and Y are compared in Fig. 
|3(a)[ where negligible difference between fy and fyi is found since V(/y',/y) ~ 1.1 x 10“^. Moreover, the 
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quantization noise behaves similarly to a Gaussian noise as shown in Fig. |3(b)| which will be useful to understand 
the idea of Gaussian Wyner-Ziv coding and Gelfand-Pinsker coding in the next section. 

2 

A performance comparison between the TCQ and polar lattice for quantization is shown in Table |T] The SNR = ^ 
is shown in dB. For the TCQ, the dimension is 1000 and the number of states is 256. For the quantization polar 
lattice, the dimension is 1024. It can be observed that the performance of the polar lattice is superior to that of 
the TCQ with roughly same block length (especially for higher rate). The performance of the Lloyd-Max scaler 
quantizer is also shown. 


TABLE I 

Performance comparison with TCQ for Gaussian source (SNR in dB). 


Rate (bits) 

TCQ 

Polar Lattice Quantizer 

Lloyd-Max Quantizer 

Rate-Distortion Bound 

1 

5.56 

5.59 

4.40 

6.02 

2 

11.04 

11.55 

9.30 

12.04 

3 

16.64 

17.57 

14.62 

18.06 


V. Gaussian Wyner-Ziv Coding 

A. System model 

For the Wyner-Ziv problem, let X, Y be two joint Gaussian source and X = Y + Z, where Z is a Gaussian 
noise independent of Y with variance A typical system model of Wyner-Ziv coding for the Gaussian case is 
shown in Fig. |4] Given the side information Y, which is only available at the decoder’s side, the Wyner-Ziv rate- 
distortion bound on source X for a target average distortion A between X and its reconstruction X is given by 

i?tvz(A) = maxjilog (21) 


B. A solution using continuous auxiliary variable 


To achieve this bound, we assume a continuous auxiliary Gaussian random variable X' which has an average 
distortion A' with source X, i.e., X' = X + N{0, A'). Then we can also obtain that X' = Y + N{0, A' -f cr^)- 
Letting cr^, be the variance of X', the difference between the mutual information I{X']X) and I{X';Y) is given 
by 


I{X'-X)-I{X'-Y) 




1 , X' + al 

2 iQg A' • 


( 22 ) 


^For a more general Wyner-Ziv model in the Gaussian case, the relationship between the two joint source can also be y = X Z, where 
Z N{0, (j^) is a Gaussian noise independent of X. In this case, we can perform the MMSE rescaling on Y to make X = oiY + Z, where 
a = -^ and Z is with variance Then the Wyner-Ziv bound is given by Ry/z{^) = max | T log , 0|. Therefore, the 

system model can still be described by Fig. [4| with Y and Z being replaced by cxY and Z, respectively. 
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> Encoder 


X ~ iV(0, C7^) ^0^ y ~ ^^(0, ^r) 


Decoder 


•X 


Z-N{Q,al) 


Fig. 4. Wyner-Ziv coding for the Gaussian case. The vaiiances of Z, Y and X are given by cr^, ay and a"^ = a^ + a^, respectively. 


Let /(X';X) — I{X';Y) = Rwzi^) and assume A < cr^. Then we have 

A' = ^A, (23) 

^2 

where rj — Note that t] is the reciprocal of the MMSE rescaling parameter in the scenario of quantizing a 

Gaussian source with variance a\ for a target average distortion A. 


r’~x(o,CT>A') 


-X~N{Q,al) 


■ y~n{Q,ct;) 


Q~NiO,A') 

R-p achjeving code 


Z~N{Q,al) 

Capacity achieving code 


Fig. 5. A solution of the Gaussian Wyner-Ziv problem using a continuous Gaussian random variable X'. 


The above-mentioned solution for the Gaussian Wyner-Ziv problem is depicted by Fig. |5] Firstly we design a 
lossy compression code for source X with Gaussian reconstruction alphabet X'. The average distortion between X' 
and X is A' = tyA. Then we construct an AWGN capacity achieving code from Y and X'. The final reconstruction 
of X is given by X = F + ^(X' — Y). Clearly ^(X' — F) is a scaled version of the Gaussian noise, which is 
independent of X'. The variance of ^(X' — F) is 


4 (A' 


2\ _ ^ 

- — + -Y 


A 
rj rj 

^z 

cfI — A. 


A- 


- A 




(24) 


Then we can check that X = X + X(0, A), which corresponds to the desired distortion, and the required data rate 

is /(X';X)-/(X';F) = ilog(4). 
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a„X' 


X~N{Q,al) 


7V(0,a A') 


(a) 




■l^~iV( 0 , al) 


N(Q,a^{X+al)) 

(b) 


Fig. 6. The MMSE rescaled channel blocks (a) and (b) for the Gaussian channels X X' and Y X', respectively. 


C. A practical solution using lattice Gaussian distribution 

The problem of the above-mentioned solution is that X' is a continuous Gaussian random variable, which is 
impractical for the design of lattice codes. In order to utilize the proposed polar lattice coding technique, X' is 
expected to obey a lattice Gaussian distribution. To this end, we perform MMSE rescaling on X' for the AWGN 
channels X ^ X' and Y X' , respectively. The rescaled channels are shown in Fig. |6] where 


and 


Oi-a — 


Qc — 




al + X' a2(a2-A)+a2A’ 
al {al-al){al-X) 


(25) 


(26) 


o-J + A' al{al - X) + alX' 

Clearly, ac < ctq- To combine the two blocks in Fig. |6] together, block (6) is scaled by Consequently, a reversed 
version of the solution illustrated in Fig. |5] is obtained and shown in Fig. |7] For the reconstruction of X, we have 
the following proposition. 

Proposition 1: To achieve the Rwz{X) bound by the reversed structure shown in Fig. |7] the reconstruction of 
X is given by 

-^A 


X = aqX' + j{^Y -aqX'), 7 = 


(27) 


Proof: It suffices to prove that X = A + A(0, A). According to Fig. |7] we have X = aqX' + A(0, a^A'), 
meaning that showing X = aqX' + N(Q, aqA' — A) would complete this proof. 

Clearly, ^Y — aqX' is a Gaussian random variable with 0 mean and variance QfgA' + ^cr^, and it is independent 
of X'. By substituting the parameters A', aq and etc, we have 


,A' - A = 




(t^A^ 

y 


A- A 


72(7?- (7,2A’ 


(28) 

(29) 
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and 


Ol-r 


as we desired. 






V'^x(CT^-A) + Cr2Acr2-A cr2 




7^ — (J^A 
z y 


'y J 




CTxO-2 - 0 - 2 A’ 


“A' —0—.r-wio.A) —0 


-^Y 


Q'-NiO, a A') 


Z'-N{0,^cjI) 
cc„ 


i D tC 

ri r r 

rfl 


1 

1 

i c 

u 

< 

r 

z 

a 

1 

R-D achieving 



Capacity achieving L 2 


(30) 

(31) 

(32) 

□ 


Fig. 7. A reverse solution of the Gaussian Wyner-Ziv problem, which is more compatible with lattice Gaussian distribution. 


Now the continuous Gaussian random variable aqX' can be replaced by a lattice Gaussian distributed variable 
A ^ where cr^ = Let X = A + N{Q,aqA') and = X + N{Q, Let B = and 

q ,2 _ _ 

(T^ = -^(Jy for convenience. By Lemma[Tl the distributions of X and Y can be made arbitrarily close to those of X 
and Y, respectively. Then the polar lattices are designed by treating X as the source and Y as its side information. 
A rate-distortion bound achieving polar lattice Li is constructed for source X with target distortion a^A', and an 
AWGN capacity-achieving polar lattice L 2 is constructed to help the decoder extract some information from Y, as 
shown in Fig. |7] Finally, the decoder reconstructs X = A + — A). Conceptually, B — A is a Gaussian noise 

which is independent of A. Recall that 7 = scales i? — A to A(0, a„A' — A). By Lemma [T] again, the 
distributions of X and X can be very close, resulting in an average distortion close to A. 

When lattice Gaussian distribution is utilized, by HU Lemma 10], Li and L 2 are accordingly constructed for 

^In fact, when A is reconstructed by the decoder, — A is not exactly a Gaussian noise -/V(0, since the quantization noise 

of Li is not exactly Gaussian distributed. However, according to Theorem [4| the two distributions can be made arbitrarily close when N is 
sufficiently large. See Fig. |3(b)|for an example. 
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the MMSE-rescaled Gaussian noise variance az and az, where 

H ^ 


;;.2 _ ^ A' — 

U rp 


ata 


?-a2A’ 


and 


Since A < cr^ 


= -|KA' + ^a2) = 


we also have < a^. 




— (7^A 

z y 




(33) 


(34) 


Fig. 8. The partitions of for quantization lattice Li (left) and channel coding lattice L 2 (right). C , Xp C , and C . 

■^Q 

Without the side information, Uf ‘ should be sent to achieve the target distortion. With the side information, however. Up ^ can be decoded 


and hence only need to be sent. 


Now we are ready to introduce the polar lattice coding strategy. The idea is similar to the one mentioned in ifTSll . 
We choose a good constellation -Da, 0-2 such that the flatness factor eA(CTq) is negligible. Let A/Ai/ • • • /A^-i/A'/ • • • 
be a one-dimensional binary partition chain labeled by bits AxjAij ■ ■ ■ j Ar-ij Ar j ■ ■ ■. Then Pai.^ and 
approach Pa, 0-2 and A, respectively, as r —00. Consider N i.i.d. copies of A. Let Ul'^ = A\'^Gm for each 
1 < (. <r. The partitions of U}'^ for both Li and L 2 are shown in Fig. 0 where the left block is for quantization 
lattice Li and the right one for channel coding lattice L 2 - According to Section Hill and IfTTI . for 0 < /3 < 0.5, the 
frozen set (Z'f^), information set if {if) and the shaping set Sf (Sf) for lattice Li (L 2 ) are given by 

' Tf = {t e [N] : > 1 - 2"^"} 

< if = [TV] : Z{Ul\Uf'‘-\A]:.’f_f > 2-^^ and Z{Ul\Uf^-\ A];:f_^, < 1 - 2"^''} (35) 

_ = p G [N] : Z{Ufuf^-\Al[f_f < 
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and 

' J-f = {* G [TV] : > 1 - 2"^"} 

< If = {i G [TV] : > 1 - 2-^^ and Z(C/f C/f < 2"^''} (36) 

5f = {ie [TV] : Z{Ul\Ul-^-\A];:^_^) < 1-2"^'' or 2"^'' < i, < 1-2-^'’}. 

For these two partitions, we have the following lemma. 


Lemma 2: Let Li and I 2 be two polar lattices constructed according to the above two partition rules respectively. 
L 2 is nested within Li, i.e., L 2 C Li. 

Proof: Both Li and L 2 follow the multilevel lattice structure (|4]l. Let {Cf,..., Cg,..., C]?} and {Cf,..., Cf,..., C^} 
denote the multilevel codes for Li and L 2 , respectively. When shaping is not involved, the generator matrixes of 
Cg and Cg correspond to the sets of row indices if U5f and if U5f, respectively. By the relationship 
and ns Lemma 3], the partition channel C{Ag_i/Kg,d1) is degraded with respect to C(A^_i/A^, ct^). Then by 
the equivalence lemma fflS Lemma 10], we have if C if, meaning that Cg C for 1 < £ < r. As a result, 
L 2 ^ EH 

By channel degradation, we have if C if. Let dFg denote the set if \ if. Meanwhile, we have Sf C 5f 
by definition. Denoting by dSg the set Sf \ Sf, dSg can be written as 

dSg = {i G [TV] : 2-^" < Z(C/f < 1 - 2"^'’ or 

2-^' < Z{Ul\Ul-^-^,A\-.^_g,B^--^) <l-2-^'}, (37) 


and the proportion ^ 0 as TV —)> 00 . Also observe that dig = if \ if = dTg U dSg. 

Given an TV-dimensional realization vector of , the encoder evaluates Ug'^ from level 1 to level r 
successively according to the random rounding quantization rules given in Section|III] (See (fOT l. (fTS l. (fTTI i and (fTSi) .') 


Recall that treating as a realization of is safe because X and TV are similarly distributed. Then u\ 


dX^ 


IS 


sent to the decoder for each level. For the decoder, the realization vector y^'-^ of is scaled to 


TTS: 

•X n 


Since Ug‘ is shared between the encoder and decoder before transmission, after receiving Ug Ug^ and Ug‘' 
can be decoded with vanishing error probability since their associate Bhattacharyya parameters are arbitrarily small 
when N ^ 00 . The details of SC decoding for Gaussian channels have been discussed in CD. According to CD 

can be evaluated with 




Lemma 8], probabilities 

gC _ 

O(TVlogTV) complexity. It is worth mentioning that Ug‘ is covered by a pre-shared random mapping in llllll . 

gQ 

However, as shown in Theorem |2] replacing the random mapping with MAP decision for Ug^ will not change the 
results of CD Theorem 5] and CD Theorem 6]. The proof is similar to that of Theorem |2] and omitted for brevity. 
Then the whole vector Ug'^ can be recovered with high probability. After obtaining the Ug'^ for 1 < £ < r, the 


realization of can be recovered from Ug^ according to the following equation 




i=i 


+ Y + Y 

.i^Xg i^Tt 


(38) 
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where gi denotes the z-th row of the polarization matrix Gn and ijj is the natural embedding. Clear is drawn 
from f? 2 ''Ziv+x,o'a- For s^ch dimension, when r is sufficiently large, the probability of choosing a constellation 
point outside the interval [—2’’“^, 2’’“^] is negligible (see uni Lemma 5] for more detail). Therefore, there exists 
only one point within [—2’’“^, 2’’“^] with probability close to 1 and can be recovered by x mod 2’’. Finally, 
the reconstruction of is given by x^'^ = 

To sum up, we have the following Wyner-Ziv coding scheme. 

• Encoding: For the A^-dimensional i.i.d. source vector X^'-^, the encoder evaluates U^? by random rounding. 


and then sends Uf^'^ to the decoder. 


Decoding: Using the pre-shared and the received the decoder recovers U-‘'^ and from the 

side information . For each level the decoder obtains , then A^'-^ can be recovered according to 


dXi 


tS? 


• Reconstruction: 

With regard to the design rate, by TheoremlH the rate of Li can be arbitrarily close to i log However, 
the encoder does not need to send that much information to the decoder because of the side information. By uni 
Theorem 7], the rate of L 2 can be arbitrarily close to ^ log ^ 2 )- After some tedious calculation, we 


have 


and 


1 / CT^tr; 


P 1 , / - CTyA 

Rl2 ^ log ( 


2 - V aj 

-2 + 


(39) 


(40) 


meaning that the transmission rate — Rl^ —4log(^) 

Before presenting the main theorem of the Gaussian Wyner-Ziv coding, we need a more stringent requirement 
on the flatness factor. This requirement is to guarantee a sub-exponentially decaying error probability for our lattice 
coding scheme. 

Proposition 2: For a one-dimensional binary partition chain A/Ai/ • • • jK^-ilR'! ■ ■ ■ and any given a, r = 
0(\ogN) is sufficient to guarantee a sub-exponentially vanishing flatness factor €A{d') = Moreover, the 

mutual information of the bottom level I{X\ Ar\Ai-r-i) 0 and using the first r levels only incurs a capacity 
loss 

Proof: Since the partition is with dimension one, we can assume that A = 77Z. Let A* = be the dual lattice 
of A. By ||24l Corollary 1], we have 


eA(o') = 0A.(27ro-^) - 1 

= ^ exp(—27r^cr^ 

AgA* 

= 2 exp(—27r 


2^2 


2)-l 

IIAf) 


(41) 

(42) 

(43) 


AG:; 
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2exp(—27r^a-^^) 

- 1 _ exp(-2^2^2% ) 

< 4exp(-27r^a-^^), 

rjz 

where denotes positive integers and the last inequality satisfies for sufficiently small t]. Let ^ = 0[-\/N) and 

hence = 0(2“'^). In addition, by ifTTl Lemma 2], a number of levels ri = O(loglogA^) is needed 

to guarantee a vanishing mutual information at the bottom level. Let 5Z/---/2’'iZ be a partition such that 

I{X\ Ar\Ai:j.-i) 0 for a constant 5. Finally, the number of levels for partition pZ/ • • • /(5Z/ • • • /2’''^Z satisfies 
r = log(^) = 0(logAf). □ 

The following theorem is proved in Appendix |Dl 

Theorem 5: Let AT be a Gaussian source and Y be another Gaussian source correlated to X as X = Y + Z, 

where Z ^ 7V(0, tr^) ™ independent Gaussian noise. Consider a target distortion 0 < A < for source X 

when Y is only available for the decoder. Let A/Ai / • • • /A,, be a one-dimensional binary partition chain such that 

^A{^q) = 0(2“'^) and r = 0{\ogN). For any 0 < j5' < P < 0.5, there exists two nested polar lattices Li and L 2 

2 

with a differential rate R = Rl^ — Rl^ arbitrarily close to ^ log(^) such that the expect distortion Aq satisfies 

Aq< A-fO(2-'^'''), (46) 

and the block error probability satisfies 

pwz ). (47) 

VI. Gaussian Gelfand-Pinsker coding 


(44) 

(45) 


M' 


Encoder 




•© 


¥■ 


Decoder 




Z~N{Q,c7l) 


Fig. 9. Gelfand-Pinsker coding for the Gaussian case. 


For the Gelfand-Pinsker problem, with some abuse of notations, consider the channel described by F = X-\-S+Z, 
where X and Y are the channel input and output, respectively, Z is an unknown additive Gaussian noise with 
variance and S is an interference Gaussian signal with variance af known only to the encoder. A diagram of 
Gelfand-Pinsker coding is shown in Fig. |9] Message M is encoded into X which satisfies the power constraint 
< P. The channel capacity of this Gaussian Gelfand-Pinsker model OTI . Il32l is given by 


CGP = il0g(l + ^). 
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To achieve this capacity, the roles of quantization lattice and channel coding lattice are reversed. To see this, 
we still start with a continuous auxiliary variable and then replace it with a discrete Gaussian distributed one. 
Letting p = we firstly design a lossy compression code for pS with Gaussian reconstruction alphabet S'. 

The distortion between S' and pS is targeted to be P, i.e.. S' = pS -\- N{0,P). Then the encoder transmits 
X = S' — pS (X is independent of S), which satisfies the power constraint. Moreover, the relationship between Y 
and S' is given by 

S' = X + pS 

= X + p{Y-X-Z) (48) 

= pY + {l- p)X - pZ. 

Then the expectation 


E[Y ■ [(1 - p)X - pZ]] = (1 - p)E[X^] - pE[Z^] = 0, 


(49) 


meaning that (1 — p)X — pZ is independent of Y, which gives S' = pY + iV(0, pj^‘ 2 . )■ Then we construct an 
AWGN capacity-achieving code to recover S' from pY. Without the power constraint, the maximum data rate 
that can be sent is actually I{S';pY). However, when power constraint is taken into consideration, some bits 
should be selected according to the realization of S since S' and S are related. The maximum data rate becomes 
I{S'; pY) — I{S'\pS) — 4 log(l + = Cap- A diagram of this solution is shown in Fig. [TOl where 

and 


^ P + 

are the variances of S' and Y, respectively. 


(Ji — cr,- 


P 


(51) 


L 


X = S'-pS 


S' 


© 


P(T^ 

7V(0, —^) 

p+cj: 


Capacity achieving code 


ort-e- 

I 

I A'CO, 


pS 


P + cjt 


R-D achieving code 


Fig. 10. A solution of the Gaussian Gelfand-Pinsker problem using continuous Gaussian random variable S'. 


Similarly, we prefer to use a discrete lattice Gaussian distributed version of S' to approach this capacity. The 
idea is to perform MMSE rescaling on S' to get a reversed version of the model shown in Fig. [Tol The analysis 
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is similar to that presented in Section |V] and is omitted here for brevity. Finally, the reversed solution is given in 

Fig.M 


a,S' 


■e- 


-^pY 

a„ 


©■ 


pS 


PcT^ 

Q'- N(0, Z ' ~ N{0, 

« P + a: P + o-^ 






1 1 

1 

_ ^ 

y a ^ 

r 

z 

1 

Capacity achieving 



R-D achieving L 2 


Fig. 11. A reverse solution of the Gaussian Gelfand-Pinsker problem. 


With some abuse of notation, let A denote the discrete version of aqS'. The MMSE rescaling factor Oc for 
channel coding and ag for quantization are given by 


Per?, 


(Xr — 


and 


Pa2 + (P + a2)2’ 


Pa? 


(52) 


(53) 


^ Pa2 + (P + a2)2’ 

respectively. The variance a\ for is chosen to be a^cr^,. Polar lattices L\ and are accordingly constructed 


for Gaussian noise variance a? and dg, where 


.2 ala?, 

a^ = -• —^ 


and 


O' 

^ 

~ 2 

P 


Pal 


(54) 


(55) 


Check that ^ < 1. Recall that X = S' — pS={l — ag)S' + UgS' — pS. When agS' is replaced by A, the 

encoded signal, denoted by X, is given by 


X = -—‘^A + A-pS. 


(56) 


Note that the distributions of S and S can be arbitrarily close when eA(o’c) —5^ 0. Clearly, A — pS is a Gaussian 
random variable independent of A with distribution N{0,agP). By Lemma [T] X can be very close to a Gaussian 
random variable with distribution V(0,P).[^ Thus, the power constraint can be satisfied. 

^Check that = (1 - 
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With some abuse of notations, let B = ^pY, B — ^pY, T = pS, and T = pS for convenience. We choose 
a good constellation f?A,cr2 such that the flatness factor eA(a’c) is negligible. Let A/Ai/ • • • /Aj._i/A'/ • • • be a 
one-dimensional binary partition chain labeled by bits A 1 /A 2 / ■ ■ ■ lAr-ijArl • • •. Then Pax,. and Ai-r approach 
D\,rj 2 and A, respectively, as r — 00. Consider N i.i.d. copies of A. Let = A\-^Gn for each 1 < £ < r. 

The partition of is shown in Fig. [12] where the left block is for quantization lattice L 2 and the right one for 
channel coding lattice Li. For 0 < /3 < 0.5, the frozen set jpf information set if {if) and the shaping set 

Sf (Sf) for lattice L 2 (LA are given by 



Fig. 12. The partitions of for quantization lattice L 2 (left) and channel coding lattice Li (right). C and C S^. Without the 

2^C -jQ 

power constraint, ^ can be sent as message bits. With the power constraint, however, ^ should be selected according to the interference 

and hence only can be fed with the message bits. 


' Pf = {i& [Af] : Z{Ul\Uf^-\A\:^_^,f^--^) > 1 - 2-^"} 

< if = {i^ [iV] : Z{Ul\Uf^-\A\-.^_A > 2-^'" and Z{Ul\Uf^-\< 1 - 2"^''} (57) 

_ = {2 G [N] : Z{Ul\Uf^-\A]:-,f_A < 2"^"}, 


' Pf = {ie [TV] : Z{Ul\Uf^-\A\[f_„B^--^) > 1 - 2"^"} 

< if = {ie [TV] : Z{Ul\Uf^-fA];-.f_A > 1 - 2-^^ and Z{UfUf^-f A];:f_^, B^''^) < 2"^''} (58) 

sf = {iG [Af] : Z{Ul\Uf^-\Al:f_A < 1-2"^'' or 2"^'' < Z{Ufuf^-fAl:f_^,B^'-^) < 1-2-^'’}. 

By channel degradation, we have pf C pf. Let dPi denote the set pf\pf. Meanwhile, we also have Sf C Sf. 

The difference dSi = Sf \ Sf can also be written as O, and the proportion —>• 0 as TV —)> c». 
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Given an A/^-dimensional realization vector of 5^'^, the encoder scales to = ps^'^ and evaluates 

j-C 

u\'^ from level 1 to level r successively according to the random rounding quantization rules. Note that ^ is 
uniformly random and known to the decoder, and is fed with message bits which are also uniform. Recall that 
treating as a realization of T^'-^ is reasonable because T and T are similarly distributed. Then can be 
obtained for 1 < f < r. When r is sufficiently large, the lattice points outside [—2’'“^, 2’'“^] occur with almost 0 
probability, the realization of A^'-^ can be determined by u]^.^ according to (l3^ . Then — ps^'^ 

is the encoded signal as discussed in (|56] ). 

For the encoder, the realization vector of is scaled to ■ The task is to recover u/ 

djr 

at each level and hence message ^ can be obtained. Note that * is shared between the encoder and decoder 

gQ 

before transmission, and U(^ can be decoded with vanishing error probability using the bit-wise MAP rule. The 
decoder still needs to know the unpolarized bits since the Bhattacharyya parameters of those indices are not 

gQ 

necessarily vanishing. Therefore, a code with negligible rate is needed to send to the decoder in advance at 
each level. This two phases transmission method has already been used in lITSl . In this sense, L 2 is not exactly 

I c X^ 

nested within Li because of those unpolarized indices. When Ug‘ is also available, Ug‘ can be decoded with very 
small error probability im with 0{N log N) complexity. 

The Gaussian Gelfand-Pinsker coding scheme is summarized as follows. 

• Encoding; According to the A^-dimensional i.i.d. interference vector S^'^, the encoder evaluates by 
random rounding, and then feeds with message bits. is pre-shared and is determined by other 
bits according to For each level, the encoder obtains for 1 < £ < r, then is recovered from 

. The encoded signal is given by 


Xl:N ^ _ pgl:N_ ( 59 ) 

Qiq 

• Decoding; Using the pre-shared and the bits by the two phases transmission, the decoder recovers 

'tC oQ 

U ‘ including the message bits and from the received signal. 

For the rate of lattice codes, we have 


and 




Rl. ^^log[^ (P + ^2)2 


indicating that the Cap- 

The proof of the following theorem is given in Appendix |E] 


(60) 


( 61 ) 


Theorem 6: Let S' be a Gaussian noise known to the encoder, and Z be another independent and unknown 
Gaussian noise with variance cr^. Consider a power constraint P for the encoded signal. Let A/Ai/ • • • / A^ be a 
one-dimensional binary partition chain such that eA(<5’c) =0(2“'^) and r = 0(log A^). For any 0 < P' < P <0.5, 
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there exists two nested polar lattices Li and L 2 with a differential rate R = — Rl 2 arbitrarily close to 

5 log(l + such that the expect transmit power Pt satishes 

Pt<P + 0{2-^^'), (62) 

and the block error probability satisfies 

P°^ <0{2-^^'). (63) 

VII. Concluding Remarks 

In this work, we present an explicit construction of polar lattices which are good for lossy compression. They are 
further utilized to resolve the Gaussian version of the Wyner-Ziv and Gelfand-Pinsker problems. Compared with 
the original idea given in IT^ . dither is not necessary in our scheme due to the property of discrete lattice Gaussian 
distribution ll25l . and the entropy encoder is already integrated in our lattice quantization process. The complexity 
of encoding and decoding complexity is 0{N \og^ N) for a sub-exponentially decaying excess distortion. 


Appendix A 
Proof of Theorem[2] 

Proof: Firstly, we change the encoding rule for the u\mi& Si and (O is modified to 


u\ if i & Pi 


0 w. p. P, 




Ui\Uf 


(64) 


1 W. p 1) 


if i ^ S\. 


Let Q'jji n \r,i N (^1 5 denote the associate joint distribution for Ui and according to the encoding 

,r 

rule described in (fTsT l and (l64l i. Then the variational distance between Pjji-.n yr.N and Q'jji.n yi;w can be bounded 
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as follows. 

mPu l.N ^y/liN ,Qjj 1 ,N y'/l;iv) 

= \Q'iui’^^y'^''^)-Pi^i^^y 


n-.N\ 




N 


(b) 


i-1 

i-l 


j=i+l 

N 


iG.FlUSl j = l j=i+l 

iGJ^lUSl i = l 


2G.7^lUtSl y/l;iV 


(c) 


< E! E! ^('*^1 5 2/ ) Y^21n2£l(Pj^i|jy^l:i-l^^l;i-l y,i.JV_.y/l:N- | | Qf/j | f f 1 ■ ’ “ 1 =„ 1 ^ “ 1 y/I: W =„/l; W 


iGJ^lU»Sl 1^^-'^-'^ ^y/l:N 




id) 


— E E -^(^1 * 2/^^'‘^)-D(Pyi|[;l:i-l^„l:t-l_y,l,iV^y/l,iV ||Q(7i|(7l:i-l^„lii-l_y/l:Ar^j^,l:Ar) 

:.FlUSl y „M-1 _j,/ 1:N- 

E v^2iEEE^^E^EE>^ 


2G.7^iUtSi 


< E \/21n2(l -77(t7{|27i^*-\y'i^")) + E v^21n2(iJ(C/*|C/i^*-i) - i7(C/{|C/i^*-\y'l^^)) 




2 G»Si 


? E \/21n2(l - + E \/21n2(Z(27*|C/i^^*"^) - Z(C/*|C/i^*-\ri^")2) 




2G5 i 


< 2Af\/41n2 • 2-^'’ = 0(2"^'''), 

where D(-||-) is the relative entropy, and the equalities and the inequalities follow from 
(a) The telescoping expansion 1^ . 

(&) Q'{u\\u]^-^~^,y'^'^) = for i € ly 

(c) Pinsker’s inequality. 

(d) Jensen’s inequality. 


(e) Q'{u\\ui ) — ^ (^1 is uniformly random) for i G Ti and Q'^ 


l-.i-l\ _ 1 
1 

(/) Z{X\Y)^ < H(X\Y) < Z{X\Y). 

(g) m- 
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Following the same fashion, 

2V((5[/1:JV y,l:iV , ^ y ,1; W ) 

< E 721n2f?(Q^.||Q'^,|C7i-\F'i^^) 

i^Si 

< E \l2\n2Z{Ul\Ul'-^-^) < N'j2\n2 ■ 2"^'’ = 0{2-^^'), 

i^S\ 

where inequality (h) follows from the MAP decision in (fl4ll for i & Si. 

Finally, we have 

nPu 1:N y'^-N , QjjI'.N y'^-n) 

< V(-Pyl;-N y/liW, Q^1:N- y/l;w) + V((5y 1; W ,i; w , Qy 1;N y/1: W ) 

= 0{2-^^'). 

Clearly, when N goes to infinity, for any R > = I{Xi-,Y'), V(P( 71 :n y/i:Ar,y,i:N^) is arbitrarily small. 

□ 


Appendix B 
Proof of Theorem[3] 

Proof: The variational distance can be upper bounded as follows. 

2 V(P( 7 l:iV y,l:N , Qjyl ^ : N y , 1 ; W ) 

= E \Q{u2^ 

.^l^.Nyn-.N 

.^i^.Nyn-.N 

i-.N ,,/l:Ar 

L ’y 


< 


(65) 


Treating (Ul '^as a new source with distribution P{u\'^,y'^'^), the first summation can be proved to 
be 0{2~^ ) in the same fashion as the proof of Theorem |2] For the second summation, we have 




( 66 ) 




= 2Y{Pij1-.N y/l:JV , QjjI-.N y/l:iV ) = 0(2 ). 
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Finally, 




□ 


Appendix C 
Proof of Theorem|4] 


Proof: Firstly, for the source Y', we consider the average performance of the multilevel polar codes with all 
possible choice of at each level. If the encoding rule described in the form of (fTTl l is used for all i S [N] at 
each level, the resulted average distortion is given by 


DpY' = 


1 

N 


E 


Pri 


’uff ,YO:- , M G^)), 


where A4(ul:^G]v) denotes a mapping from to according to (l3^ (remind that x is drawn from A 
according to instance, let A = Z and the partition is given by Z/2Z/...2’"Z, then x^'^ = x\'^ + 

2x2^ + ... + 2’'“^a;^'^ and x\'^ = uY^Gm- When r —>■ oo, there exists a one-to-one mapping from u\'.^ to x^'^. 


Then we have 


Dp_Y' 


1 n I^Y.N /l:Af\ 7/ ll-.N P.N\ 

^ 2^ PjCl:Af^i^/l:N(X , t/ )d[y ,X ) 

J-I-.N ^y/l-.N 

^ E y') 

x,y' 



{y' - xf 

2A 


){y' 


xfdy' 


= A. 

The result Dp,y' = A is reasonable since the encoder does not do any compression. If we replace Ppi-.N yo-.N y'^'^) 

with Qiji.N Yii.N{u\'fY,y'^'^) and compress y'^'^ to at each level, the resulted average distortion Dq^y’ can 
be bounded as 

Dq,y'= E Quff ^y'^'^)d{y'^'^ ,M{u\'.^ Gn)) 


<— Pul:f,Y'^-^iUllr,y'^''^)diy'^''^,MiuY^GN)) 


+ ^ E \Pui:f ^y'^'^) — Quff jy'^'^)\d{y'^'^ Gn)) 

Since the densities of both F' and X decrease exponentially to their square norms, the distortion caused by large 
X or y' is negligible, we can always assume a maximum distortion dmax between y' and x. Then we have 


Dq,y' < Dp,y' + («l;., y , y ^)) • Nd, 

= A+ 0(2-^'"'), 
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where the equation follows from (|20] | and r = 0{loglogN) ifTTl Lemma 5]. 

Now we consider using the same encoder to quantize the Gaussian source Y. The resulted average distortion 
Dq y can be written as 


Dq,y = jz ^ Q^i-.n Yi N{u\\^,M{ u\.^Gn)) 


N 

1 

TV 


^ ^^^, 7W (m} if G w))■ 


Since the same encoder is used, for a same realization , we have Qjji-.Ny^i-.N {u\:^ 

and hence 

Dq,y - Dq^y' = ^ ^ {PY^-N{y^'-^) - PY’^-N{y^''^)) ■ Qui-.N\Yi.N{u\-.^\y^'^)d{y^'^G n)) 

< ^iV<ax ^ \PY^:N{y^--^) - PYn...{y^--^)\. 

yl:N 

Again, by the telescoping expansion, 


N 

= E E ( 2 /*) - Py' {y^-^-^)PYn+^:N {y^+^--^) 

yi-N i = l 

N 

i = l yi 

< N-8€A{aA)- 

Lemmafl] 


As a result. 


Dq,y < a + 0(2-^"') + 8eA(aA)dmaxA. (67) 

By scaling A, we can make eA(CTA) <C ^ and Dq y can be arbitrarily close to A with R > I{X;Y') > 
^ log ^ (n could be 1). When eA(o'A) —>■ 0, we have I{X;Y') —^ ^ log ^ and i? > ^ log 

Now it is ready to explain the lattice structure. From the definition of Ti and ifTTl Lemma 6], it is easy to find 
that C Ti-\ for 1 < £ < r. When is uniformly selected and = 0 at each level, the constructed polar 
code at level £ — 1 is a subset of the polar code at level £. Therefore, the resulted multilevel code is actually a polar 
lattice and the MAP decision on the bits in is a shaping operation according to Da,^^- Moreover, since Dq y is 
an average distortion over all random choices of there exists at least one specific choice of uf^ at each level 
making the average distortion satisfying dGTl i. This is exactly a shift on the constructed polar lattice. Consequently, 
the shifted polar lattice achieves the rate-distortion bound of the Gaussian source. □ 
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Proof of Theorem[5] 
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Proof: We firstly show that the target distortion can be achieved. Recall Ul'^ = Gn for each level 1. 
Let Pjji.N denote the joint distribution between and when the encoder performs no compression 

at each level, i.e., the encoder applies encoding rule (foT l for all indices i G [fV] at level 1, encoding rule (fTTI i for 

all i G [A^] at level 2 and similar rules for higher levels, with the notation X and Y' being replaced by A and 

_ 

X, respectively. Let Qjji.n ^i;w denote the joint distribution when only C/^ is recorded following the random 

j^Q 

rounding rule at each level. ‘ is a uniformly random sequence shared between the encoder and decoder, and 
is determined according to the MAP rule (see (fT4l i and (fTsT i'). As illustrated in (l20l i. 






( 68 ) 


Since r = O(logW), we can write Y{Pui:n = 0(2 ). When quantization is performed 

the source X, let Qiji.n x^-n denote the resulted joint distribution. By Lemma [T] again, 

u\:f! 

= X - Px^'^{x^''^)\ < N-8eA{ag). 

^l-.N 

It has been shown that eA{dq) = 0(2“^^), we further have 


for 


Y{Pur-N x^,N ^Quy-N x^'.n) < Y{Pijy.N x^.N ,Qjj1-.n x^,n) +Y{Qjj1-.n x^,n ,Qu1-.n x^.n) 

l:r ’ l:r ’ l:r ’ 1 ;t' ’ l:r ’ l:r ’ 


(69) 

(70) 


As mentioned in Section |V] the encoder only sends to the decoder, which then utilizes the side information 
to recover . Here we assume that ' can be correctly decoded and (7^ is recovered according to the 
MAP rule. In this case, the decoder enjoys the same joint distribution Qu^n as the encoder does. Recall that 
B = and B = ^Y. Let Qiji.n x^-n denote the resulted joint distribution of X^-^, and B^'-^ 

■^Q 

when the encoder performs compression, i.e., compresses X^'^ to ‘ at each level. Let Piji-.n x^-^f ^ denote 

the resulted joint distribution of , X^'-^, and B^'-^ when the encoder performs no compression for X^'-^. 


2Y{Pui:^ x^-^,X^-^ 

ujlf M'-N 

ujlf ,x^-^ M'-N 


(71) 
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According to Fig. |7] aqX' X ^ B and X ^ B ?at two Markov chains. We have 




N 

n 


exp(-^-2-)• 


=1 2ct2 


( 72 ) 

( 73 ) 


Therefore, 


Y{Pui.n xi.nbi-.n,Qu1:N = 0{2 ^ ). ( 74 ) 

Recall that the reconstruction of X is given by X = A + ^{B — A). The average distortion Ap caused by 
Pjj^ N x^-.N Bi-.N can be expressed as 

u\:^ 

where SA'^ = ■jA'^ + (1 — 'y)Ai{ui:^G n), where Ai{u]^.!^GN) is a mapping from to according to 
the lattice Gaussian distribution. Clearly, given u\':^, there is a one-to-one mapping between and a;^'^ when 
r is sufficiently large. Thus, Ap can be written as 


Ap = 


^ p pliAT „1:1V V.N~. 

— 2_^ Ir'jji-.N x^.N ,X , )'^[X ,X ) 




N 


Pxi-.N^X^-.N {x^''^, x^''^)d{x^''^ ,x^''^) 


*1:^7 ^l;iV 


N 


N^Px^x{x,x)d{x,x) 


[jxix) 

J X 


r+°° 1 

: exp(- —){x — xYdxdx 




= A. 


The expected distortion Aq achieved by Qjji.n ^i-.n satisfies 

Aq - Ap = — {Pul:N Xi'N^B^:N-Qjjl:N xl:N^B^:N)d{x'^'-’^,x'^'-’^) 

ul:^ ,x^--N 

yl:N 

= 0(2-^'^'). 

pC 

Now we show that the decoder is able to decode ^ with vanishing error probability. 


‘2'A{Pu1-.n QI.N ,Qij1-.n qi.n) — yj 

1 : r ’ 1 : r ’ r ^ 

|P(4y,0-Q(4^^,0| 



II 

M 

1 ~ X^'^, 1 


^l:W 


< y~! \Pu}'X .x^-^ ~ Qul'p .x^--^ 

^l-.N 

= 0(2-^"'). 


( 76 ) 
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By the result of CD, Pul :AT SI: AT Tesults 111 ail expectation of error probability Ep[Pe\ at each level such that 
Ep[Pe] = 0{2~^^ ). To see this, let Si denote the set of pairs of u]'^ and such that the SC decoding error 
occurs at the ith bit for level £, then the block decoding error event is given by S^ = Then the expectation 

of decoding error probability over all random mapping is expressed as 

4[Pe] = ^ e S^] 

< ^2 ^2 T’(7l:N € fi] 

zGXfuS® 


E 

p{uY-\ 


Upl-l, 

l:Ar 





t[P{v^,\uY-\ 

< P(w^© 1 


E 

p{uY~\ 

l:Af PL/® 

Upl-l, 

l:Ar 






-1 ipN p-.N) 


\ 

P{'u£i\uY-^ 

„.1:N p-.N\ 

Ppi-lP ) 


i-1 

5 





ill) 


= 0(2-^"'). 

Then by the union bound, we immediately obtain that the expectation of multistage decoding error probability for 
the polar lattice Ep[Pe] = 0{2~^^ ). Let P^^ denote the expectation of error probability caused by Qpi-.N pi-.N, 

(j,S 

i.e., it is an average error probability over all choices of the frozen bits ^ and shaping bits ^ for each level. 


Let £ denote the set of the pairs {u \\^such that a lattice decoding error occurs. We have 

p^^-Ep[Pe\ = Y. 

< 2V{Ppl-.N pl.N ,Qpl-.N p1,n) 

< 0(2-^'"'). 


(78) 

(79) 

(80) 


With regard to the data rate, we have 


and 


Finally, 


r i^Q 


N 


E 




N 


log 


^log 


all\ 




^-E —- —• 

r=i 


( 81 ) 

(82) 

(83) 

□ 
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Appendix E 
Proof of Theorem[6] 

Proof: We firstly show the power constraint P can be satisfied. Recall = A^'^Gn for each level £. Similar 

to the previous proof, denote by Pjji-.n fi-.N the joint distribution between and Tfjf when the encoder applies 

j-Q 

random rounding rule for all indices i £ [A^] at level £. Denote by Qjji.n the joint distribution when C/^ * and 

ds 

' are encoded following the random rounding rule at each level. U^‘- is a uniformly random sequence shared 

jjr gC 

between the encoder and decoder, ^ is a uniform message sequence and C/^ ^ is determined according to the 
MAP rule. 

Notice that ehicfq) < ej\{dc) = Similar to the previous proof, we have 

YiPjj^.N < 0{2-^^') + 0(2-'^) = 0 ( 2 -^'"'). ( 84 ) 


Thus, the average transmit power realized by Qiji-.n can be arbitrarily close to that realized by Piji-.n 
i.e., 


^ - P^^.N )]d{^ Gn),^''^) 


OLq 


< —Ndmax^{Pul'::f ) Quff 

< 0(2-^^'), 


(85) 

( 86 ) 
(87) 


where Gn) denotes a mapping from to according to the lattice Gaussian distribution. Note 

that for a constant cr^ and the assumption of a maximum distortion dmax is reasonable since the Gaussian 
distribution decays exponentially with large distortion. 

Now we show that the average transmit power realized by Pjji-.n fi-.N is arbitrarily close to P. When r is 
sufficiently large, PAi-r T = A + N{0, UqP) as shown in Fig. [TT] Then the variable ^-f^A + (A — 

f) = ^ A + N{0,aqP) corresponds to a variable resulted from adding a lattice Gaussian distributed variable 
to an independent Gaussian noise. Notice that only involves a scale on D^^al- When e\{dq) < 0(2~^), 

the flatness factor associated with the AWGN channel from to ^-A^A + N{Q, UqP) is also upper bounded 

by 0(2"^). 

Check that ( = (1 ~ Oiq)P. Let X and X denote i^^A + N{0,aqP) and Gaussian random variable 
with distribution A^(0, P), respectively. By Lemma[T] V(Py, P^) < 0{2~'^).Let = -^A4{uf.^GN) — d'^, 

we have 




d'-^)d{ 


1 




1 

N 


Pxi,Nd{x^-^ 

^l-.N 


0 ) 


— [a; ]. 


( 88 ) 

(89) 

(90) 
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Since Y{Px,Px) < 0(2 

Ex[x^] — Exlx"^] < 0(2 Consequently, 


Pt = 

A ^ Q!„ 

(91) 




< 

4 —+0(2-^'" ) 

(92) 




< 

Ex[x^] + 0(2-'^) + 0(2-^^') 

(93) 

= 

P + 0(2-^^'). 

(94) 


Now we prove the reliability. Recall that Y = S + X -\- Z, where X = S' — pS is independent of S. Scaling Y 
by p gives us 

pY = pS' + p(l — p)S + pZ (95) 

= acS' + p(l — p)S — (etc — p)S' + pZ. (96) 


Check that 


leaving us etc — p = 


Note that both pS — 


— 


P{P + <yl) 

Paf + {P + aff- 


= (1 - aq)p. 


aq{l — p). Scale pY by —. Then 


— pY = aqS' + —{1 — p){pS — UqS') + —pZ. 

etc etc ac 

aqS' and Z are independent of S'. Replacing etgS" with A, we have 


(97) 


(98) 


^pY = A+^{l-p){pS-A) + ^pZ, (99) 

which corresponds to the reverse solution shown in Fig. [TT] Let Y denote the channel output when S is replaced 
by S, i.e., 


^pY = A+^{1- p){pS-A) + ^pZ. (100) 

Recall T = pS, T = pS, and B = ^pY. Also let B = ^pY. According to the previous proof, we already have 
nPul :N ^i:A^, ) Si f9(2 ). hlote that Z is an independent Cjaussian noise, it is not difficult to obtain 

that 




'V(^Pljl:N Jil-.N gl-.N j QjjI.N rpl.N gl-.N) — 'V {Pjjl-.N ^rpi.,N ^ Q jjl.N ^rpl-.N ) S 0(2 ), 

since P^i.n^iji-.n = Q rpi-.w- Ry (176b . 


-Nf^' 1 


V(Pyl;N^^l:iV , QjjI-.N Jjl.N) S V(L[/1:N 2'l:iV ^^1 ;N , Q jjl: N rpl. N Jjl. N ) S 0(2 ) 


( 101 ) 


( 102 ) 


Note that B is a variable obtained by adding A to a Gaussian noise, and B is the real signal received because 
the side information S is Gaussian distributed. Similar to (ITTT i. the expectation of error probability Ep[Pe] caused 
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by Pul ■.N gi.N can be unpper bounded as Ep[Pe\ < 0(2 ). Finally, the expectation of error probability 

caused by Qjji.n gi.N satisfies 



(103) 


(104) 


□ 
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